Parsing Free Word-Order Languages in Polynomial Time
نویسندگان
چکیده
Long-Distance Scrambling is a word-order phenomenon which is “doubly unbounded” in that (i) more than one element can move, and (ii) movement can be unbounded. In (Becker et al., 1991), we argue that scrambling is beyond TAG by assuming that elementary trees express a complete predicate-argument structure. In (Becker et al., 1992), we show that no formalism in the class LCFRS (which includes TAG) can derive scrambling. (Becker et al., 1991) proposes two variants of the TAG formalism which can derive scrambling while still preserving most of the desirable properties of TAGs (i.e., an extended domain of locality and the factoring of recursion). However, little is known about the formal and computational properties of those systems. (Rambow, 1994) proposes V-TAG, which is closely related to one of the previously proposed varaiants, but redefines the derivation relation. V-TAG can derive the relevant set of sentences and also cases where scrambling co-occurs with long-distance topicalization (a separate linguistic phenomenon also found in English, in which a single element moves into sentenceinitial position):
منابع مشابه
تأثیر ساختواژهها در تجزیه وابستگی زبان فارسی
Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...
متن کاملDiachronic Trends in Word Order Freedom and Dependency Length in Dependency-Annotated Corpora of Latin and Ancient Greek
One easily observable aspect of language variation is the order of words. In human and machine natural language processing, it is often claimed that parsing freeorder languages is more difficult than parsing fixed-order languages. In this study on Latin and Ancient Greek, two wellknown and well-documented free-order languages, we propose syntactic correlates of word order freedom. We apply our ...
متن کاملDeveloping a Minimalist Parser for Free Word Order Languages with Discontinuous Constituency
We propose a parser based on ideas from the Minimalist Programme. The parser supports free word order languages and simulates a human listener who necessarily begins sentence analysis before all the words in the sentence have become available. We first sketch the problems that free word order languages pose. Next we discuss an existing framework for minimalist parsing, and show how it is diffic...
متن کاملRobust and efficient semantic parsing of free word order languages in spoken dialogue systems
This paper presents a semantic parser for spoken dialogue systems. The parser is designed especially for the analysis of free word order languages by providing a feature called orderindependent matching. We describe how this feature allows writing of rules for free word order languages in an elegant way (using German as example language) and how it increases the robustness against speech recogn...
متن کاملPartially Ordered Multiset Context-free Grammars and Free-word-order Parsing
We present a new formalism, partially ordered multiset context-free grammars (pomsCFG), along with an Earley-style parsing algorithm. The formalism, which can be thought of as a generalization of context-free grammars with partially ordered right-hand sides, is of interest in its own right, and also as infrastructure for obtaining tighter complexity bounds for more expressive context-free forma...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/cmp-lg/9411008 شماره
صفحات -
تاریخ انتشار 1994